538 research outputs found
On the use of hierarchical subtrace mining for efficient local process model mining
Mining local patterns of process behavior is a vital tool for the analysis of event data that originates from flexible processes, for which it is generally not possible to describe the behavior of the process in a single process model without overgeneralizing the behavior allowed by the process. Several techniques for mining such local patterns have been developed throughout the years, including Local Process Model (LPM) mining and the hierarchical mining of frequent subtraces (i.e., subprocesses). These two techniques can be considered to be orthogonal, i.e., they provide different types of insights on the behavior observed in an event log. As a consequence, it is often useful to apply both techniques to the data. However, both techniques can be computationally intensive, hindering data analysis. In this work, we explore how the output of a subtrace mining approach can be used to mine LPMs more efficiently. We show on a collection of real-life event logs that exploiting the ordering constraints extracted from subtraces lowers the computation time needed for LPM mining compared to state-of-the-art techniques, while at the same time mining higher quality LPMs. Additionally, by mining LPMs from subtraces, we can obtain a more structured and meaningful representation of subprocesses allowing for classic process-flow constructs such as parallel ordering, choices, and loops, besides the precedence relations shown by subtraces.</p
On generation of time-based label refinements
Process mining is a research field focused on the analysis of event data with
the aim of extracting insights in processes. Applying process mining techniques
on data from smart home environments has the potential to provide valuable
insights in (un)healthy habits and to contribute to ambient assisted living
solutions. Finding the right event labels to enable application of process
mining techniques is however far from trivial, as simply using the triggering
sensor as the label for sensor events results in uninformative models that
allow for too much behavior (overgeneralizing). Refinements of sensor level
event labels suggested by domain experts have shown to enable discovery of more
precise and insightful process models. However, there exist no automated
approach to generate refinements of event labels in the context of process
mining. In this paper we propose a framework for automated generation of label
refinements based on the time attribute of events. We show on a case study with
real life smart home event data that behaviorally more specific, and therefore
more insightful, process models can be found by using automatically generated
refined labels in process discovery.Comment: Accepted at CS&P workshop 2016 Overlap in preliminaries with
arXiv:1606.0725
Classifying Process Instances Using Recurrent Neural Networks
Process Mining consists of techniques where logs created by operative systems are transformed into process models. In process mining tools it is often desired to be able to classify ongoing process instances, e.g., to predict how long the process will still require to complete, or to classify process instances to different classes based only on the activities that have occurred in the process instance thus far. Recurrent neural networks and its subclasses, such as Gated Recurrent Unit (GRU) and Long Short-Term Memory (LSTM), have been demonstrated to be able to learn relevant temporal features for subsequent classification tasks. In this paper we apply recurrent neural networks to classifying process instances. The proposed model is trained in a supervised fashion using labeled process instances extracted from event log traces. This is the first time we know of GRU having been used in classifying business process instances. Our main experimental results shows that GRU outperforms LSTM remarkably in training time while giving almost identical accuracies to LSTM models. Additional contributions of our paper are improving the classification model training time by filtering infrequent activities, which is a technique commonly used, e.g., in Natural Language Processing (NLP).Peer reviewe
DeepAlign: Alignment-based Process Anomaly Correction using Recurrent Neural Networks
In this paper, we propose DeepAlign, a novel approach to multi-perspective
process anomaly correction, based on recurrent neural networks and
bidirectional beam search. At the core of the DeepAlign algorithm are two
recurrent neural networks trained to predict the next event. One is reading
sequences of process executions from left to right, while the other is reading
the sequences from right to left. By combining the predictive capabilities of
both neural networks, we show that it is possible to calculate sequence
alignments, which are used to detect and correct anomalies. DeepAlign utilizes
the case-level and event-level attributes to closely model the decisions within
a process. We evaluate the performance of our approach on an elaborate data
corpus of 252 realistic synthetic event logs and compare it to three
state-of-the-art conformance checking methods. DeepAlign produces better
corrections than the rest of the field reaching an overall score of
across all datasets, whereas the best comparable state-of-the-art
method reaches
Encoding conformance checking artefacts in SAT
Conformance checking strongly relies on the computation of artefacts, which enable reasoning on the relation between observed and modeled behavior. This paper shows how important conformance artefacts like alignments, anti-alignments or even multi-alignments, defined over the edit distance, can be computed by encoding the problem as a SAT instance. From a general perspective, the work advocates for a unified family of techniques that can compute conformance artefacts in the same way. The prototype implementation of the techniques presented in this paper show capabilities for dealing with some of the current benchmarks, and potential for the near future when optimizations similar to the ones in the literature are incorporated.Peer ReviewedPostprint (author's final draft
Encoding conformance checking artefacts in SAT
Conformance checking strongly relies on the computation of artefacts, which enable reasoning on the relation between observed and modeled behavior. This paper shows how important conformance artefacts like alignments, anti-alignments or even multi-alignments, defined over the edit distance, can be computed by encoding the problem as a SAT instance. From a general perspective, the work advocates for a unified family of techniques that can compute conformance artefacts in the same way. The prototype implementation of the techniques presented in this paper show capabilities for dealing with some of the current benchmarks, and potential for the near future when optimizations similar to the ones in the literature are incorporated.Peer ReviewedPostprint (author's final draft
Exploiting Event Log Event Attributes in RNN Based Prediction
In predictive process analytics, current and historical process data in event logs are used to predict future. E.g., to predict the next activity or how long a process will still require to complete. Recurrent neural networks (RNN) and its subclasses have been demonstrated to be well suited for creating prediction models. Thus far, event attributes have not been fully utilized in these models. The biggest challenge in exploiting them in prediction models is the potentially large amount of event attributes and attribute values. We present a novel clustering technique which allows for trade-offs between prediction accuracy and the time needed for model training and prediction. As an additional finding, we also find that this clustering method combined with having raw event attribute values in some cases provides even better prediction accuracy at the cost of additional time required for training and prediction.Peer reviewe
- …